SambaLingo-Turkish-Base is a bilingual (Turkish and English) model based on Llama-2-7b pre-training, adapted for Turkish by training on 42 billion tokens from the Turkish portion of the Cultura-X dataset.
Large Language Model
Transformers Supports Multiple Languages